Topology agnostic hot-spot avoidance with InfiniBand

نویسندگان

  • Abhinav Vishnu
  • Matthew J. Koop
  • Adam Moody
  • Amith R. Mamidala
  • Sundeep Narravula
  • Dhabaleswar K. Panda
چکیده

InfiniBand has become a very popular interconnect, due to its advanced features and open standard. Large scale InfiniBand clusters are becoming very popular, as reflected by the TOP 500 supercomputer rankings. However, even with popular topologies like constant bi-section bandwidth Fat Tree, hot-spots may occur with InfiniBand, due to inappropriate configuration of network paths, presence of other jobs in the network and un-availability of adaptive routing. In this paper, we present a hot-spot avoidance layer (HSAL) for InfiniBand, which provides hot-spot avoidance using path bandwidth estimation and multi-pathing using LMC mechanism, without taking the network topology into account. We propose an adaptive striping policy with batch based striping and sorting approach, for efficient utilization of disjoint network paths. Integration of HSAL with MPI, the de facto programming model of clusters, shows promising results with collective communication primitives and MPI applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Methodology for Deadlock-Free Minimal Routing in InfiniBand Networks

The InfiniBand Architecture (IBA) defines a switchbased network with point-to-point links whose topology is arbitrarily established by the customer. Often, the interconnection pattern is irregular, which complicates routing and deadlock avoidance. Current routing algorithms for NOWs, either achieve a low network performance, such as the up*/down* routing scheme, or cannot be implemented on IBA ...

متن کامل

Dynamic Routing Balancing On InfiniBand Networks*

InfiniBand (IBA) technology was developed to address the performance issues associated with messages movement among Endnodes and computer I/O devices. However, InfiniBand is also widely deployed within high performance computing (HPC) clusters due to the high bandwidth and low message latency attributes it offers to inter-processor communication systems. An interconnection-network efficient des...

متن کامل

A Backbone-Aware Topology Formation (BATF) Scheme for ZigBee Wireless Sensor Networks

In a tree-structured ZigBee wireless sensor network, nodes close to the root of the tree (i.e., hot-spot nodes) may exhaust their power earlier than those distant from the root due to heavy loads on packet forwarding. This hot-spot problem is inherent in tree-structured networks and may demand extra energy to recover from failures of hot-spot nodes. In this paper, the backbone-aware topology fo...

متن کامل

An Efficient Backbone Aware Shortest Path Selection Protocol in Zigbee Wireless Networks

The zigbee tree routing is wide used in several resource restricted devices and applications. Since it doesn’t need any routing table and route discovery overhead to send a packet to the destination. But the ZigBee tree routing has the elemental limitation that a packet follows the tree topology. Therefore, it cannot provide the optimum routing path. The shortcut tree routing protocol that prov...

متن کامل

Reducing hot-spot contention in shared-memory multiprocessor systems

In parallel systems it is possible for several processors to request concurrent access to a shared data structure such as a synchronization variable. Such an access pattern causes what is known as hotspot contention. In shared-memory multiprocessor systems that use a multistage interconnection network, hot-spot contention may result in "tree saturation" that degrades the system performance. It ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Concurrency and Computation: Practice and Experience

دوره 21  شماره 

صفحات  -

تاریخ انتشار 2009